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Foreword 

This  manual  describes  the  new  syntax  of  Crystal,  a  strongly- typed  lexically-scoped  functional  language 
for  programming  maissively  parallel  naachines,  and  an  implementation  of  a  Crystal  interpreter  based 
on  T,  a  dialect  of  Scheme.  It  is  a  reference  manual,  intended  to  define  the  language  for  which  the 
interpreter  is  implemented.  It  is  not  intended  as  an  introduction  or  a  tutorial.  The  reader  is  expected 
to  have  some  basic  knowledge  about  Crystal  [1,  2). 


1  Language  Definition 

In  this  section,  we  describe  the  syntactic  structures  and  their  associated  meanings  of  Crystal.  Examples 
are  given  for  illustration. 


1.1  Notational  Convention 

We  follow  the  conventions  below  in  describing  the  syntax  of  Crystal. 


[a]  a  is  optional 

{a}  zero  or  more  repetition  of  a 
a  I  6  choice  between  either  a  or  ft 
a  actual  text  in  the  program 

(a)  grouping 

This  work  bu  been  supported  in  part  by  the  Office  of  Moral  Reaearcb  under  Contract  No.  N00014-90-J-1987  and  the 
Science  Foundation  under  Contract  No.  622A-31-47097. 
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2  P  Langu^e  Definition 

1.2  Programs  and  Definitions 

A  Crystal  program  cq^atains  a  set  of  possibly  matually  recursive  definitions  and  optional  ontput  expres¬ 
sions,  haying  the  syntax: 

'  program (definition  [doc]  \  ? expression)  {definition  [doc]  \  ? expression} 
definition  -*  identifier  [:  type-exp]  ■  expression 
doc  -*  docC  anything  except  a  right  curly  brace  > 

The  optional  type-exp  in  the  definition  specifies  the  type  of  the  identifier.  Each  1  expression  is  an  output 
expression  whose  value  will  be  printed  out. 

Example  Below  is  an  example  of  a  simple  Crystal  program: 
n  »  10 

D  =  intervald.n) 

F  »  df  (x):D  {2  *  x} 

?F 

?D 


1.3  Expressions 

Expressions  are  the  most  basic  construct  in  Crystal  programs.  It  is  used  to  express  all  the  semantic 
objects  of  Crystal.  The  basic  semantic  objects  of  Crystal  consist  of  integers,  floating  point  numbers, 
booleans,  and  strings.  The  composite  semantic  objectsof  Crystal  consists  of  tuples,  tagged  values,  index 
domains,  data  fields,  lists,  sets,  and  functions  over  these  objects.  Expressions  have  the  following  syntax: 


expression  — ►  constant  |  identifier  |  tuple  |  tagged-exp  |  expression  :  type-exp 
I  unary-op  expression  \  expression  binary-op  expression 
I  conditional  |  function-abst  |  function-appl 
I  domain-exp  |  dfield-exp  j  list-exp  |  set-exp 
I  (  expression  )  \  expression  «here'{{de^nttton}} 

The  nonterminals  constant  and  identifier  wiU  be  defined  in  Section  1.11.  The  tuple  and  tagged-exp 
are  tuples  and  tagged  values,  which  will  be  defined  in  Section  1.5.  The  expression  expression  :  type-exp 
is  a  typed  expression  in  which  a  type  is  explicitly  specified  for  the  expression;  it  will  be  defined  in 
Section  1.10.  The  unary-op  is  the  family  of  unary  prefix  operators  and  the  binary-op  is  the  family  of 
binary  infix  operator;  they  will  be  defined  in  Section  l.ll.  The  conditionalis  the  conditional  expression, 
which  will  be  defined  in  Section  1.4.  The  function-abst  and  function-appl  are  function  abstraction  and 
application,  which  will  be  defined  in  Section  1.6.  The  domain-exp  is  for  index  domain  expressions, 
which  will  be  defined  in  Section  1.7.  The  dfield-exp  is  for  data  field  expressions,  which  will  be  defined  in 
Section  1.8.  The  list-exp  is  for  list  expressions  and  the  set-exp  is  for  set  expressions;  they  will  be  defined 
in  Section  1.9.  The  (...)  is  for  grouping.  The  . .}  construct  is  for  introducing  local  definitions 

for  identifiers  occurring  in  the  preceding  expression. 


1.4  Conditional 

The  conditional  expression  has  the  form: 
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if  expression  then  expression 
{ 1 1  expression  then  expression} 

[[II]  else  expression] 
fi 

The  expressions  before  then  are  called  guards  and  those  after  then  are  called  consequences.  Guards 
should  denote  boolean  values.  The  expression  after  else  is  called  the  default  expression,  which  is 
optional.  The  value  of  a  conditional  expression  is  the  first  consequence  with  a  true  guard.  When  there 
is  no  true  guard,  the  value  of  the  conditional  is  the  default  expression  if  there  is  one,  otherwise  it  is 
undefined. 

Example  Below  is  a  conditional  expression  that  picks  out  the  maximum  among  x,  y,  and  z: 

if  X  >  y  and  x  >  z  then  x 
1 1  y  >  z  then  y 
1 1  else  z 
fi 


1.5  Tuples  and  Tagged  Values 

Tuples  are  used  for  expressing  elements  of  product  types  or  product  domains  (explained  later).  Tagged 
values  are  used  for  expressing  elements  of  summed  types  or  summed  domains  (explained  later).  They 
have  the  following  syntax: 

tuple  —»  (  [expression  ,  expression  {,  expression}]  ) 
tagged-exp  —*  tag"  tuple 
tag  —*  expression 

There  is  no  single-element  tuple.  They  are  considered  to  be  equivalent  as  the  element  itself.  In  the 
expression  tag"  tuple,  the  tag  should  denote  a  natural  number,  and  the  length  of  tuple  (defined  below) 
should  be  the  same  as  the  number  of  constituent  domains  of  the  summed  domain. 

Let  t  and  u  be  the  values  denoted  by  tag  and  tuple,  respectively,  and  the  product  domain  doX  . ..  x 
dn-i  be  the  type  of  tuple.  The  meaning  of  tag"  tuple  is  t(u(t)),  where  u{t)  is  the  t-th  element  of  the  tuple 
u  and  L  is  the  injection  function  from  the  domain  d|,0  <  t  <  n,  to  the  summed  domain  do  +  •  •  •  +  dn-i- 
The  syntax  for  the  tagged  expression  is  somewhat  unconventional,  but  it  provides  more  information 
for  the  analysis  performed  by  the  parallelizing  compiler.  It  also  makes  expressions  that  depend  on  the 
value  of  the  tag  more  compact,  as  is  shown  in  the  defintion  for  f3  in  the  example  in  section  1.6. 

Selectors  and  Operators  for  Tuples  and  Tagged  Expressions 

Let  e,eo,ei,e2, . .  ■  be  expressions.  The  following  selectors  and  operators  are  defined  for  tuples  and 
tagged  expressions: 

(eo, Cl,  •  •  • , Cn-i )  CO  denotes  the  value  of  e,-,  where  0  <  i  <  n. 

length(co,ci, . .  .,e„_i)  denotes  n. 

tag.of  (c“(eo,ci, . .  .,e„_i))  denotes  the  value  of  e. 

value.of  (c‘(eo,ci, . .  .,e„_i))  denotes  the  value  of  (eo,ei»---,«n-i)- 
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Example  Below  are  examples  of  tuples  and  tagged  values: 

tup  ■  (1,2,3) 
tv  ■  I'tup 

We  will  have  the  following:  tup(O)  is  1,  langthCtup)  is  3,  tag-of  (tv)  is  1,  and  valua^f  (tv)  is  (1,2,3). 

1.6  Function  Abstraction  and  Application 

Function  abstraction  and  application  have  the  following  syntax: 

funciion-abst  — ►  fn  formal  :  type-exp  [|  filter]  {  expression  } 
function-appl  expression  (  expression  )  |  expression  tuple 
formal  —*  identifier  ]  (  [identifier  { ,  identifier)  ]  )  |  formal  “  formal 
filter  — ♦  expression 

Function  abstraction  is  used  for  expressing  functions,  which  are  first  class  objects.  A  filter  is  an  expres¬ 
sion  denoting  a  boolean  value.  When  there  is  a  filter  in  the  function  abstraction,  the  actual  argument  is 
checked  against  the  filter.  If  it  does  not  pass  the  filter  then  the  returned  value  is  undefined.  The  formal 
can  be  of  the  form  of  a  single  identifier,  a  tuple  of  identifiers,  or  identifiers  in  a  tagged  expression  form. 
The  actual  argument  will  be  destructured  properly  according  to  the  form  of  the  formal. 

Example  Below  are  examples  of  functions: 
fl  *  fn  x:nat  {  x*2  } 

f2  =  fn  (x,y)  :  prod.t7p«(nat,nat)  {  x  ♦  y  } 
f3  »  fn  (t‘(u,v))  :  8uai_typ«(bool,nat)  {  t*(not  u,  v  *  3)  } 
fac  »  fn  n  :  int  I  n  >■  0  { 
if  n  «  0  then  1 
1 1  else  n  *  fac(n-l) 

} 

We  will  have  the  following:  fl(3)  is  6,  f2(2,3)  is  5,  f 3(0“ (true, nil))  is  ti(false),  where  ii  is  the 
injection  function  from  bool  to  8uiB-type(bool.nat);  and  f3(l‘(nil,3))  is  13(9),  where  12  is  the 
injection  function  from  nat  to  sum-type  (bool, nat).  The  constant  nil  is  used  to  express  an  undefined 
value.  The  function  fac  is  the  usual  factorial  function,  in  which  a  filter  is  used  to  make  sure  the 
argument  is  non-negative. 

1.7  Index  Domains 

An  index  domain  is  a  set  of  points  indexed  in  some  way.  Index  domain  expressions  have  the  following 
syntax: 

domain-exp  —*  iiitair9al(.expres8ion,expression) 

I  dom{  expression  {;  expression)  } 

\  pxo±.domidomain-etp, domain-exp  {, domain-exp)  ) 

I  auB^om(,domain-expfdomain-exp  {, domain-exp)  ) 

I  gen-prodjdmi /ormol  :  domain-exp  [I  filter]  {  domain-exp  } 
j  g8n.8um  formal  :  domain-exp  [  I  filter]  {  domain-exp  > 

I  domC  formal :  domain-exp  i  filter  j 
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Let  ei,  62,  •••,€„  be  expressions  denoting  integer  values.  An  interval  index  domain  is  constructed  by 
intervaKci  ,62),  with  ci  being  its  lower  bound  and  62  being  its  upper  bound.  It  contains  62  -  ci  +  1 
points  indexed  from  €i  to  62*  If  62  <  ei,  then  int«rval(ei,e2)  is  empty.  An  enumerated  index  domain 
is  constructed  by  dom-fei ; e2 : . . . ; Cn}.  It  contains  n  points  indexed  by  the  values  of  ei ,  e„.  Having 
duplicate  elements  is  permitted  and  data  fields  defined  over  them  will  have  only  one  value  for  the  same 
index.  Note  that  there  should  be  no  space  between  dom  and  {. 

Let  do,  ...  ,dn  be  index  domain  expressions.  The  product  domain  and  sum  domain  of  do,. . .,  d^  are 

denoted  by  the  expressions  prod_dom(do>. . .  ,d„)  and  «uiii_dom(do . d„),  respectively.  Let  Xq  be  an 

expression  denoting  an  element  in  do,  and  2„  an  element  of  dn,  the  tuple  (.xq,.  . .  ,Zn)  is  an  element 
of  prod.dom(do,. . .  ,d„).  The  tagged  expressions  0*(xo,nil,. . .  .nil),. . .,  n‘(nil,. . .  ,nil,i„)  are 
elements  of  sunudomCdo,. . .  ,d„). 

Let  z  be  a  formal,  d  be  a  domain  expression,  e  be  another  domain  expression  with  occurrences  of  x, 
and  /  be  a  filter  (i.e.,  a  boolean  expression)  with  occurrences  of  i.  The  gen-prod-dom  and  gen_sum_dom 
are  for  constructing  general  product  and  sum  domains.  The  expression  gen_prod_doin  x :  die}  denotes 
the  product  domain  of  all  domains  produced  by  instantiating  the  formal  i  in  e  with  every  point  in  the 
domain  d.  When  there  is  a  filter  expression  /,  x  is  only  instantiated  with  points  in  d  that  pass  /  (i.e. 
/  evaluates  to  true).  Similarly  for  genjunudom. 

Let  x,d,f  be  as  described  above.  The  expression  dom[z:d|/]  is  a  domain  restriction  denoting  the 
restricted  domain  that  contains  only  points  of  d  that  pass  the  filter.  Note  that  there  should  be  no  space 
between  dom  and  [. 

Operators  on  Index  Domains 

Let  d  be  a  domain  expression,  and  c  be  an  expression  denoting  an  domain  element  (i.e.,  an  integer,  a 
tuple,  or  a  tagged  expression),  the  following  functions  are  provided: 

card(d)  The  cardinality  of  d. 

contains?(d,e)  True  if  d  contains  e  as  an  element.  False  otherwise. 

lower_bound(d),  upper-boimd(d)  If  d  is  an  interval  domain  then  they  return  the  lower  and  upper 
bounds  of  d.  Otherwise  they  are  undefined. 

Example  Below  are  examples  of  domain  expressions. 

dl  »  interval(l,2) 
d2  »  dom{4;6} 
d3  =  prod.domCdl ,d2) 
d4  =  sum-dom(dl,d2) 

dS  «  gen-prod-dom  (x,y):d3  I  eTsaTCx+y)  {interval (x,y)> 
d6  »  gen-suffl-dom  (x“y)  :d4|  (x«*l){interval(0,y(x))} 
d7  *  dom[(x,y)  :prod_dom(dl,dl)  I  (x  <•  y)] 

The  domain  dl  contains  points  indexed  by  the  set  {1,2}.  The  domain  d2  contains  points  indexed 
by  the  set  {4,6}.  The  domain  d3  contains  points  indexed  by  the  set  {(1,4),  (1,6),  (2,4),  (2,6)}. 
The  domain  d4  contains  points  indexed  by  the  set  {  0“(l,nil),  0"(2,nil),  l“(nil,4),  l“(nil,6)  }.  The 
domain  d5  is  equivalent  to  prodjdom(interval(2,4) .lnterval(2,6)).  The  domain  d6  is  equivalent 
to  sum-dom(interval(0,4) ,  interval (0,6)).  The  domain  d7  contains  points  indexed  by  the  set 
{(1,1).(1,2),(2,2)}. 


[16:01  March  24,  1991] 


6 


1  Language  Definition 


1.8  Data  Fields 

A  data  field  is  a  function  over  some  index  domain.  It  assigns  each  point  in  the  index  domain  a  value. 
Therefore,  a  data  field  can  be  viewed  as  a  set  of  values  Indexed  by  the  underlying  domain.  Data  field 
expressions  have  the  following  syntax: 

dfield-exp  -*  expression  |  |  )  expression}  } 

I  df  formal  :  domain-exp  [  I  filter]  {  expression  } 

I  df  C  expression  |  formal  :  domain-exp  [  |  filter]  ] 

Let  eo,  Cl,  ...,  e„_i  be  expressions,  an  enumerated  one-dimensional  data  field  is  constructed  by 
df{eo:  6] ;  ...;  e„_i}.  This  data  field  is  implicitly  defined  over  the  domain  intervalCO.n  -  1). 
Let  the  identifier  a  be  bound  to  such  a  one-dimensional  data  field,  then  a(i)  denotes  the  value  of 
e,,  where  0  <  t  <  n.  A  two-dimensional  enumerated  data  field  of  size  m  by  n  is  expressed  by 
dom{eo,o:  •••;  eo.n-i;:  •••:;  efn-i.i:  •••;  This  data  field  is  implicitly  defined  over 

prod_dom(interval(0,m  -  1)  , interval (0, n  —  1)).  Notice  that  each  row  must  be  of  equal  length. 
Again,  let  a  be  bound  to  such  a  data  field,  then  a(»,y)  denotes  the  value  of  Cjj,  where  0  <  t  <  m,0  < 
j  <  n.  Similarly,  we  use  ; ; ;  for  three-dimensional  data  fields,  where  each  page  must  be  of  equal  size. 
Note  that  there  is  no  space  between  df  and 

Let  I  be  a  formal,  d  be  a  domain  expression,  e  be  an  expression  with  occurrences  of  x,  and  /  be 
a  boolean  expression  with  occurrences  of  x.  The  expression  df  {  c  }  is  a  data  field  which  assigns 
every  index  point  a:  in  d  with  the  value  denoted  by  c.  When  there  is  a  filter  /,  only  index  points  passing 
/  are  assigned  a  value.  Values  on  index  points  failing  /  are  undefined.  The  expression  df  [e  I  x:d]  is  just 
another  syntactic  form  for  df  x;d  {  e  >.  Note  that  there  is  no  space  between  df  and  [. 

Operators  on  Data  Fields 

Let  a  be  an  expression  denoting  a  data  field  having  values  01,03, . .  .,o„;  p  be  an  expression  denoting  a 
binary  associated  function;  and  i  be  the  identity  element  of  g.  The  following  functions  are  provided: 

domain.of  (a)  It  denotes  the  domain  over  which  o  is  defined, 
card  (a)  It  denotes  the  cardinality  of  domain-of  (o). 

compact  (a)  It  denotes  the  data  field  defined  with  the  same  values  on  a  new,  compacted  domain  with 
undefined  values  and  their  associated  index  points  purged. 

reduce (p  ,i  ,a)  It  denotes  the  reduction  of  g  over  o.  The  value  of  the  reduction  is  equivalent  to 
g{-  ■  ■gig{i,ai),a2) . . .).  When  a  is  empty,  the  value  of  the  reduction  is  defined  to  be  t,  the 
identity  element  of  y  as  mentioned  above. 

scan (jf, 2, a)  It  denotes  the  data  field  of  scanning  g  over  a,  where  t  is  as  defined  above.  If  a  is  not 
one-dimensional,  its  canonical  linear  ordering  is  assumed  for  the  scan. 

Example  Below  are  examples  of  data  field  expressions. 

al  ■  df{l;  2;  3}  !!  1-d  anuaeratad  data  fiald 

a2  •  df{l:  2;;  3;  4}  !!  2-d  anumaratad  data  fiald 

a3  ■  df{l;  2;;  3;  4;;;  5;6:;  7;8}  !!  3-d  anumaratad  data  fiald 

d  «  prodjdom(intarval(0,2) ,intarval(0,2)) 
a4  »  df  (i,j):d  {  if  i  ■■  0  than  al(j)  alaa  a4(i-l,j)  fi  } 
aS  ■  df [x*yl(i,j):d] 
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The  data  field  al  is  the  set  of  values  { 1,2,3}  indexed  by  the  domain  {0,1,2}  and,  for  example,  al(0)  is 
1.  The  data  field  a2  is  the  set  of  values  { 1, 2,3,4}  indexed  by  the  domain  {(0,0),(0,1),(1,0),(1,1)}  and, 
for  example,  a2(l,i)  is  4.  The  data  field  a3  is  the  set  of  values  { 1,2,.  ■  .,8}  indexed  by  the  domain 
{  (0,0, 0),(0, 0,1),. .  .,(1,1,1)}  and,  for  example,  a3(l,0,l)  is  6.  The  data  field  a4  is  the  set  of  values 
{  1,2,3, 1,2, 3,1, 2,3}  indexed  by  the  domain  {(0,0),(0,1),...,(2,2)};  a4(0,l)  is  2.  The  data  field  aS  is  the 
set  of  values  {  0,0, 0,1, 2, 3, 2,4, 6}  indexed  by  the  dommn  {(0,0),(0,1),. .  .,(2,2)}  and,  for  example,  aSCl  ,2) 
is  2. 

1.9  Lists  and  Sets 

Lists  and  sets  are  available  in  Crystal  and  have  the  following  syntax: 

list-exp  — ►  li8t{  [expression  {;  expression}]  } 
set-exp  — ►  set-C  [expression  {;  egression}]  } 

I  set  formal  :  domain-exp  [I  filter]  {  expression  } 

Let  ej , . . . ,  e^  be  expressions.  The  expression  list-C  ej, . . . ,  e^  denotes  a  list  with  enumerated  elements 
ei, . .  .,e„.  Similarly  for  the  expression  8et{  ei,. 

Let  X  be  a  formal,  d  be  a  domain  expression,  e  be  an  expression  with  occurrences  of  i,  and  /  be 
a  boolean  expression  with  occurrences  of  x.  The  expression  set  i:d  •(  e  }  denotes  the  set  containing 
the  values  denoted  by  instantiating  e  with  every  index  point  x  in  d.  When  there  is  a  filter  /,  only  the 
values  of  those  index  points  passing  /  are  contained  in  the  set. 

Operators  on  Lists  and  Sets 

Let  be  expressions  denoting  lists,  e  be  a  general  expression,  and  /  be  an  expression  denoting  a 

function.  The  following  operators  are  defined. 

null?(/)  True  if  I  is  null,  false  otherwise. 

contains?  (e,f)  True  if  I  contains  e  as  an  element,  false  otherwise. 

lengthd)  The  number  of  elements  in  1. 

head(/)  The  first  element  of  1. 

taild)  The  rest  of  I  excluding  the  first  element. 

cons(e,/)  The  list  with  e  being  its  head  and  I  being  its  tail. 

append(/i,/2)  Append  Ij  to  fj.  This  is  the  same  as  “/i  : :  /2”. 

map(/, /)  The  list  of  applying  /  to  every  element  of  /. 

Let  s,Si,S2  be  expressions  denoting  sets,  e  and  /  be  as  described  above.  The  following  operators  are 
defined. 

empty? (s)  True  if  s  is  null,  false  otherwise. 

contains?  (e,/)  True  if  s  contains  e  as  an  element,  false  otherwise. 
card(s)  It  denotes  the  number  of  elements  in  s. 
union(si,S2)  The  union  of  si  and  Sj. 
inter(ai,S2)  The  intersection  of  si  and  sj. 


[16:01  March  24,  1991] 


8 


1  Language  Definition 


diff  (51,33)  The  difference  of  5i  and  53. 
product  (51,53)  The  product  of  5i  and  53. 
map(/,5)  The  set  of  applying  /  to  every  element  of  5. 

Example  Below  are  examples  of  list  and  set  expressions. 

1  =  2;  3} 

si  s  set{l;  2;  3} 

s2  3  set  X  :  interval ( 1 . 10)  I  even?(z)  {  z  3  } 

The  list  1  and  the  set  si  contain  elements  1,2,  and  3.  The  set  82  is  {5,7,9,11,13}. 

1.10  Type  Expressions 

Type  expressions  provide  extra  information  for  static  type  checking,  which  catches  static  type  errors 
at  compile  time  and  reduces  the  chance  of  hitting  a  runtime  error  during  program  execution.  Though 
specifying  the  type  of  expressions  is  optional  in  most  cases,  programmers  are  encouraged  to  provide 
type  information  whenever  possible.  Type  expressions  have  the  following  syntax: 

type-exp  — »  constant-type  |  domain-exp  ]  list-of  ( type-e^)  j  set_of  (type-exp) 

I  sum-typa ( type-exp, type-ezp{. type-exp}) 

I  pro<L.typ«(type-eip,type-exp{,typc-exp}) 

1  lun_typ«  ( type-exp ,  type-exp) 

I  dfieldLtjp«(d'omam-exp,type-ezp) 
constant-type  -*  bool  |  nat  |  int  |  float )  string  |  char  |  dooain  |  type 

The  reserved  words  bool,  nat,  int,  float,  string,  and  char  denote  the  constant  types  of  boolean, 
natural  numbers,  integers,  floating  point  numbers,  character  strings,  and  characters,  respectively.  The 
reserve  words  domain  denotes  the  type  of  domains,  and  t3rp«  denotes  the  universal  type.  Let  t,  ti, . . . , 
be  expressions  denoting  some  type,  and  d  be  a  domain  expression.  The  type  expression  list-of  (1) 
denotes  the  type  of  all  lists  with  elements  of  type  t.  Similarly  for  set-of  (t).  The  type  expression 

prod_type(ti ,. . .  ,t„)  denotes  the  product  type  of  tj Similarly  for  sum-type (ti . t„).  The 

expression  df  ield-type(d,t)  denotes  the  type  of  data  fields  defined  over  d  with  values  of  type  t,  and 
fun-type(ti,t3)  denotes  the  function  type  from  ti  to  <3. 

1.11  Basic  Lexical  Definitions 

Here  we  describe  the  syntax  of  identifiers,  boolean,  integers,  floating  point  numbers,  and  strings.  We 
also  describe  the  unary  prefix  and  binary  infix  operators  for  them. 

unary-op  not  |  - 

binary-op  |  |  ♦  1  /  |  ♦•  |  «od  |  div  |  <  |  >  |  <■  |  >■  j  ■»  |  <> 

and  I  or  I  : :  |  • 

constant  -*  nil  |  boolean  |  numeric  j  character  |  string 
boolean  true  |  false 
numeric  -♦  integer  \  float 
integer  -»  [~]digit{digit} 

float  ([-]dtytt{dtytt}.{digit})  I  ([-]{(f^'t}.dtytt{dtytt}) 
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digit  0  I  1 1  ...  I  9 

identifier  alpha{alpha  \  digit  |  specicd) 

alpha  a|A|b|B|...  jz|Z 

special  -♦_|?|X|||ft|| 

character  -*  i  any  character 

string  -+  "  anything  but  double  quote  *' 

The  unary  prefix  operator  not  is  the  boolean  negation  and  -  is  the  numeric  minus  sign.  The  binary 
infix  operators  +,  *,  /,  and  ••  are  the  numeric  additio.^,  subtraction,  multiplication,  division,  and 

exponentiation;  mod  and  div  are  the  integer  modulo  and  integer  division;  <,  >,  <»,  >®,  ==,  and  <> 
are  the  numeric  comparison  operators  for  less  than,  greater  than,  less  or  equal  to,  greater  or  equal  to, 
equality  testing,  and  inequality  testing;  and  and  or  are  the  boole2m  and  and  or;  : :  is  for  string  and  list 
concatenation  (i.e.,  appending);  C  is  the  function  composition  operator.  All  binary  infix  operators  are 
left  associative  and  their  relative  precedences  are  defined  in  a  way  similar  to  the  language  C.  Users  are 
encouraged  to  use  parentheses  to  disambiguate  an  expression  whenever  in  doubt. 

A  string  consists  of  all  the  characters  appearing  between  2  double  quotes  ("),  taken  as  such,  with 
the  exception  of  the  backslash  character  (\),  which  serves  as  an  escape.  The  following  escapes  are 
defined:  \b  (backspace),  \f  (form  feed),  \n  (newline),  \r  (carriage  return),  \t  (horizontal  tabulate),  \v 
(vertical  tabulate),  \\  (backslash),  \"  (double  quote),  \000  to  \2S5  (character  whose  ASCII  code  is  the 
decimal  number  appearing  after  the  backslash  (3  di^ts  are  required),  \“c  (character  corresponding  to 
control-c  (where  cis  any  character  that  can  be  "controlled”)),  \. .  .\  (all  the  characters  (only  formatting 
characters  are  allowed)  between  the  two  backslashs  are  ignored  (as  well  as  the  backslashs  off  course). 
Examples  of  formatting  characters  are:  space,  newline,  tab).  For  non  alphanumeric  character  that 
don’t  appear  in  the  above  list,  the  backslash  doesn’t  change  anything.  This  convention  for  string 
representation  is  taken  from  Standard  ML. 

Characters  are  represented  by  prepending  a  f  in  front  of  the  representation  of  the  character  in  a 
string.  For  example,  ta  is  the  character  a,  iXb  is  the  backspace  character,  t\255  is  the  character  whose 
ASCII  code  is  255  in  decimal.  Note  that  •#  represents  the  character  #. 


1.12  Built-in  Numeric  Functions 

Let  1,11,12,. .  .,Xn  be  expressions  den''ting  numeric  values,  and  •  •,*»»  he  expressions  denoting 

integers.  The  following  numeric  functions  are  built-in: 

Predicates:  even?(t),  odd?(i)  (predicates  for  integers  testing  even  and  odd). 

Arithmetics  Functions:  addCxj, . .  .,Xn),  subtract (xi,X2)»  »ultiply(xi, . .  .,x„),  divideCxi,  12), 
negate (i),  remainder (11,12),  absCx),  gcd(ii,t2),  min(xi,. .  max(xi, . .  .,i„),  round(i) 
(round  to  nearest  integer),  floor(x),  and  ceiling(x). 

Transcendental  Functions:  exp(x)  (exponential  function  c*),  log(x)  (base  2  logarithm),  ln(x) 
(natural  logarithm),  sinCx),  cosCx),  tan(z),  asinCx),  acosCx),  and  atan(x). 

Bitwise  Logical  Operators:  logand(ti,t2)  (bitwise  logical  and  of  ii  and  12),  logior(ti,t2)  (bitwise 
logical  inclusive  or  of  u  and  12),  logxor  (ii,  *2)  (bitwise  lo^cal  exclusive  or  of  ii  and  12),  lognot  (i) 
(bitwise  logical  not  of  i). 
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2  Using  the  Crystal  Interpreter 

An  interpreter  based  on  T  3.1  has  been  implemented  for  the  language  defined  in  the  previous  section. 
It  is  available  for  anonymous  ftp  from  /pub/Crystal/crystal-int-3.0.tar.Z  on  cs.yale.edu.  This  section 
describes  how  to  use  the  interpreter  and  reminds  you  some  potential  problems  in  using  it. 

2.1  Invoking  the  Interpreter 

Please  follow  the  steps  below  to  invoke  the  interpreter. 

•  Set  up  your  shell  environment  variable  CRYSTAL  to  the  directory  containing  the  T  object  codes  of 
the  Crystal  interpreter.  If  you  are  at  Yale  CS  Department  and  have  access  to  the  /cs /homes  tree, 
then  do  the  following: 

setenv  CRYSTAL  /cs/homea/aysteam/ayang/crystal/obj 

•  Invoke  T  (either  from  csh  or  from  Emacs). 

•  Issue  (load  ’(crystal  crlnt))  to  load  in  the  Crystal  interpreter. 

•  Issue  (cr:repl)  to  enter  the  read-eval- print-loop  of  the  Crystal  interpreter. 

2.2  Interpreter  Environment 

The  read-eval-print  loop  of  the  interpreter  provides  users  with  an  interactive  environment.  Users  can 
enter  definitions,  output  expressions,  or  both,  after  the  prompt  i:,nd  the  result  of  the  output  expressions 
will  be  printed  out.  The  following  commands  are  also  recognized  by  the  read-eval-print-loop  of  the 
Crystal  interpreter: 

loadC'f ilename")  Load  in  the  definitions  in  the  Crystal  program  in  file  filename 

and  evaluate  all  the  output  expressions, 
exit  and  quit  Leave  the  loop  and  get  back  to  T. 

help  Print  a  short  message  about  the  interpreter  and  about  how  to  enter 

multiline  input. 


2.3  Caveats 

Robustness:  This  interpreter  is  still  immature  in  terms  of  error  recovery  and  robustness.  Syntax  errors 
are  caught  by  the  parser  and  the  parsing  will  be  halted  immediately.  The  precise  location  of  the 
offending  token  is  reported.  Run  time  errors  are  left  to  T.  When  hitting  a  run  time  error,  the 
execution  will  break  into  T  with  little  clue  to  help  you  to  figure  out  what  went  wrong.  Using 
(backtrace)  to  inspect  the  call  stack  is  the  most  effective  way  for  finding  out  what  had  gone 
wrong.  Then  use  (reset)  and  then  (cr:repl)  to  get  back  to  the  interpreter. 

Size  of  Index  Domains:  Because  of  the  way  this  interpreter  implements  index  domains  and  data 
fields,  their  sizes  should  not  be  too  big  (over  lOK  elements).  Otherwise,  your  program  execution 
probably  will  experience  many  garbage  collection  cycles. 

Case  of  identifiers:  Because  the  implementation  is  based  on  T,  this  interpreter  inherits  the  limitation 
of  T  that  cases  of  identifiers  are  ignored. 
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Order  of  Definitions:  Theoretically  the  order  of  the  definitions  in  Crystal  programs  should  not  mat¬ 
ter.  However,  because  the  way  interpreter  translates  the  definitions  right  now,  non-function  defi¬ 
nitions  should  be  given  before  they  are  used.  This  restriction  is  expected  to  go  away  in  the  near 
future. 


3  Example  Crystal  Programs 


If  you  are  at  Yale  CS  Department,  there  are  a  collection  of  example  Crystal  programs  available  in 
the  directory  /cs/homes/systems/ayang/crystal /examples.  If  you  have  any  interesting  Crystal  pro¬ 
grams  that  you  would  like  to  share  with  others,  please  copy  them  into  the  world-writable  directory 
/cs/homes/systems/ayang/crystal/contrib. 

These  programs  are  Included  in  the  distribution  tar  file  mentioned  in  the  previous  section.  If  you 
are  not  at  Yale  CS  Department,  please  consult  with  the  person  who  installed  the  interpreter  at  your 
site  about  the  location  of  these  example  programs. 

Below  are  two  programs  pulled  from  the  .../examples  directory.  The  first  one  is  a  simple  matrix 
multiplication  program  using  reduction  on  data  fields.  The  second  one  is  the  bitonic  sort  program,  also 
works  on  data  fields. 


!!!!!!!!!!!!!!!!!! I !!! I !! 1 1 1 1 1  I  !!  1 1 1 !  I !  1 1 !  1 1 1 1 1 !!  I  !!  I  !!!•!!{  I  I  { ! 

!  Matrix  Multiplication  using  reduction. 

!  This  program  illustrates  the  use  of  domains,  data  fields,  and 
the  reduction  over  data  fields. 

!!!!!!!!!•!!!  m;  I  !  1 1 1 1  !  1 1 1 1 1 1 1 1 1 1 1  •  1 1  •  1 1 1  •  1 1  •  1 1  •  1 1  M  I  1 1 1 1  •  I  I  I  I  1 1 


n  *  3 

D  =  interval(0,n-l) 
D2  =  prod_dom(D,D) 


A  =  df{  1; 

2;  3;; 

4;  5;  6;; 

7; 

8;  9 

> 

B  =  df{  1; 

o 

o 

0;  2;  0;; 

0; 

0;  3 

} 

C  *  df(i,j):D2  {  reduceCadd,  0,  (df  k:D  a(i.k)*B(k, j)}))  } 
!!  C  should  be  df{  1;  4;  9;;  4;10;18;;  7;16;27  } 

?  C 
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! !  !  I  !  {  !  1 1 1  1 1 ! !  I  i  I  •  j !  I ! !  1 1 1 1 1 1 1 1 1 1 1 1 !  I !  1 1 ! !  1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1  1 1 1 1  I 

! ! !  Bitonic  Sort .  mi 

!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! 

X  *  df{6;2;5;9;3;l;7;4} 
n  »  card(x) 

logn  ®  ceilingClog(n)) 

d  =  interval (0,n  -  1) 
e  =  interval (0, logn  -  1) 

flip  *  fn  (x.i)  :  prod_type(nat,nat)  <  logxor(x,2**i)  > 

bit  =  fn  (x,i)  :  prod_type(nat,nat)  {  logaod(x,2**i)  <>  0  > 

xor  =  fn  (x.y)  :  prod_type(nat ,nat)  {  <x  or  y)  and  not  (x  and  y)  > 

bsort  =  df  j  :  interval (0. logn  -  1)  •( 
df  (i.k)  :  prod_dom(d,interval<0, j))  < 

if  xor(bit(i.j  +  l),bit(i.J  -  k))  then  aaxCa.b) 
else  min(a,b)  fi 

where{  jprev  *  if  k»«0  then  j  -  1  else  j  fi 

kprev  ■  if  k«*0  then  j  -  1  else  k  -  1  fi 
a  »  if  j>0  then  bsort ( jprev) (i, kprev)  else  x(i)  fi 
b  «  if  j>0  then  bsort ( jprev) (flip (i,j  -  k), kprev) 
else  x(flip(i,0))  fi 

} 

} 

} 

y  =  df [bsortdogn  -  l)(i,logn  -  1)  I  i  :  d] 

?  y 
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